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distribution  is  quoted,  permission  has  been  obtained  to  use  the 
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_  Citations  of  commercial  organizations  and  trade  names  in 

this  report  do  not  constitute  an  official  Department  of  Army 
endorsement  or  approval  of  the  products  or  services  of  these 
organizations. 

_  In  conducting  research  using  animals,  the  investigator (s) 

adhered  to  the  "Guide  for  the  Care  and  Use  of  Laboratory 
Animals,"  prepared  by  the  Committee  on  Care  and  use  of  Laboratory 
Animals  of  the  Institute  of  Laboratory  Resources,  National 
Research  Council  (NIH  Publication  No.  86-23,  Revised  1985) . 

_  For  the  protection  of  human  subjects,  the  investigator (s ) 

adhered  to  policies  of  applicable  Federal  Law  45  CFR  46. 

In  conducting  research  utilizing  recombinant  DNA  technology, 
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the  National  Institutes  of  Health. 

In  the  conduct  of  research  utilizing  recombinant  DNA,  the 
'investigator (s)  adhered  to  the  NIH  Guidelines  for  Research 
Involving  Recombinant  DNA  Molecules . 
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Introduction 

The  goal  of  this  project  is  postdoctoral  training  for  Dr.  Robert  Nolte  in 
the  area  of  structural  biology  as  applied  to  problems  in  cancer  and  to  breast 
cancer  in  particular.  The  report  below  corresponds  to  a  revised  SOW, 
formally  requested  in  a  submission  to  Dr.  Patricia  Modrow  in  a  letter  dated 
August  8,  1996.  The  revised  SOW  had  been  described  and  justified  in  a 
revised  Progress  Report  for  1994-1995  submitted  in  January,  1996;  approval 
received  in  June,  1996  . 

The  Problem:  Cancer  involves  anomalies  in  cellular  regulation.  Two  key 
regulatory  steps  are  the  transduction  of  signals  from  the  cell  surface  and  the 
control  of  specific  gene  expression.  The  estrogen  receptor,  on  which  the 
original  SOW  for  this  grant  focused,  participates  in  both  these  steps.  It 
receives  hormonal  signals  and  in  response  activates  defined  genes.  The 
recently  discovered  BCRAl  gene  appears  to  encode  a  nuclear  protein  that  may 


regulate  gene  expression  (Miki  et  al,  1994).  Progress  in  structural  studies  of 
tyrosine-kinase  mediated  signaling  pathways  and  protein/ DNA  complexes 
that  regulate  gene  expression  makes  the  possibility  of  structure-based  drug 
discovery  and  development  a  real  one,  but  many  basic  principles  remain  to  be 
discovered. 

Previous  work,  (i)  A  number  of  structural  studies  of  protein/ DNA 
complexes  involved  in  transcriptional  regulation  have  provided  the 
guidelines  of  how  to  think  about  specific  recognition  (see,  for  example, 
Harrison,  1991  and  Steitz,  1990).  An  important  notion  to  emerge  both  from 
the  structural  studies  and  from  studies  of  transcriptional  regulation  at  many 
complex  promoter/ enhancer  sites  is  that  large  multi-protein  complexes 
boimd  to  DNA  are  critical  for  the  sort  of  combinatorial  control  seen  in  cells  of 
all  higher  eukaryotes  (and  in  human  cells  in  particular).  Work  in  our 
laboratory  on  the  polymerase  El  transcription  factor  known  as  TFITfA  has  led 
to  crystallization  of  a  complex  between  a  part  of  the  factor  containing  six  "zinc 
fingers"  (each  finger  is  a  distinct  DNA-binding  domain)  and  a  31  BP  DNA  site. 
This  is  a  larger  complex  than  any  so  far  studied  from  this  class  of 
transcriptional  regulatory  proteins.  TFIIIA  is  unusual  among  transcription 
factors  in  that  it  binds  not  only  to  sites  that  regulate  gene  expression  but  also 
to  the  RNA  transcript.  A  similar  double  specificity  has  recently  been  observed 
for  the  product  of  the  Wilms  tumor  suppressor  gene,  WTl  (Caricasole  et  al, 
1996).  (ii)  Protein  tyrosine  kinases  are  responsible  for  many  key  signaling 
events  in  normal  and  cancer  cells.  The  kinases  related  to  Src  have  been 
especially  widely  studied,  in  part  because  of  the  early  importance  of  Src  in 
revealing  the  existence  of  oncogenes  and  oncogene  products.  We  have 
determined  the  structure  of  the  Src-homology  2  (SH2)  and  Src-homology  3 
(SH3)  domains  of  the  Lck  kinase,  and  we  are  engaged  in  an  effort  to  crystallize 
the  auto-inhibited  form  of  the  full  Lck  and  Src  proteins.  An  important 
adjunct  to  this  effort,  which  could  spawn  a  number  of  drug-design  strategies. 
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will  be  to  apply  the  expression  approaches  developed  for  Src/Lck  to  the 
protein  that  is  "next"  in  the  Lck-initiated  pathway  of  T-cell  signaling,  ZAP-70. 

Purpose  of  present  work.  The  broad  goal  is  to  enhance  our  understanding  of 
specific  protein-protein  and  protein-DNA  interactions  in  gene  regulation  and 
in  signal  transduction.  In  particular,  we  are  concluding  the  structure 
determination  of  a  complex  between  a  portion  of  the  DNA-binding  segment 
from  the  polymerase  m  regulatory  factor  TFIIIA  and  its  cognate  site.  We  are 
also  building  upon  our  own  recent  success  in  structural  studies  of  regulatory 
domains  from  Src-like  protein  kinases  to  establish  a  broader  program.  Thus, 
Dr.  Nolte  will  first  complete  the  TFIILA/DNA  structure  determination  (in 
collaboration  with  Dr.  Ray  Brown  of  this  laboratory),  a  task  that  should  be 
completed  in  the  next  few  months.  He  will  then  focus  his  efforts  on 
expanding  the  preliminary  ZAP-70  studies  The  cancer  relevance  of  these 
studies  is  described  in  the  body  of  the  report. 

Methods.  All  of  the  projects  described  here  involve  expression  of  proteins,  or 
of  specific  protein  domains,  in  order  to  crystallize  them  for  investigation  by  X- 
ray  diffraction.  X-ray  methods  are  the  best  way  to  determine  the  detailed 
atomic  structure  of  a  protein,  a  necessary  pre-requisite  for  molecular  analysis 
of  fimction  and  drug-targeting  strategies. 

Body  of  Narrative 
TFniA: 

The  transcription  factor  IHA  (TFIIIA)  from  Xenopus  laevis  binds  both  DNA 
and  RNA.  The  discovery  of  multiple  zinc  finger  motifs  in  TFIIIA  pointed  to  a 
complex  mode  of  protein-DNA  interaction.  Each  of  these  motifs  is  an 
independently  folded  protein  domain  that  recognizes  a  specific  DNA 
sequence  of  three  base  pairs.  Nine  tandem  zinc  fingers  are  present  in  TFIIIA, 
allowing  the  protein  to  bind  to  an  extensive  region  of  the  5S  RNA  gene 
promoter. 
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The  first  six  zinc  fingers  of  Th'lllA  have  been  expressed  in  E.coli  and 
reconstituted  with  a  duplex  of  two  31  base-pair  synthetic  DNA  strands  (shown 
in  Figure  la,b)  whose  sequence  derives  from  the  Xenopus  oocyte  5S  RNA 
gene  promoter  .  Crystals  of  the  complex  were  grown  using  "native"  DNA  and 
one  of  two  different  substituted  DNAs,  in  which  three  thymidine  nucleotides 
were  modified  to  contain  iodine  at  specific  locations.  Six  complete  data-sets 
were  collected  on  two  trips  to  the  National  Synchrotron  Light  Source  at 
Brookhaven  Laboratory  during  the  last  year  and  the  diffraction  statistics  of 
these  data  sets  are  summarized  in  Table  1.  These  data-sets  were  collected 
from  either  native  or  derivative  crystals  with  the  data  in  some  cases 
extending  to  3.15A.  Anisotropy  in  the  data  limits  the  effective  resolution  to 
approximately  3.5A.  Difference  Patterson  maps  have  been  solved,  and  the 
position  of  the  iodine  atoms  determined.  From  these  iodine  positions,  low 
resolution  phases  have  been  calculated,  and  Dr.  Nolte  has  been  working  to 
extend  these  phases  to  the  resolution  limit  of  data.  The  initial  and  current 
phasing  statistics  are  summarized  in  Table  2.  The  current  phases  allow  the 
calculation  of  an  electron  density  map  in  which  the  DNA  can  clearly  be  seen. 
Dr.  Nolte  has  built  the  31  base  pairs  of  DNA  into  the  density  as  illustrated  in 
Figure  2.  Density  is  dso  visible  for  the  protein,  as  shown  in  Figure  3,  and  he 

has  tentatively  built  a  Ca  trace  of  183  of  the  190  residues  contained  in  this 
construct.  An  overview  of  the  current  structure  at  approximately  4.5A 
resolution  shows  the  arrangement  of  the  6  fingers  on  the  DNA  (Figxure  4).  A 
novel  interaction  between  one  of  the  fingers  and  the  minor  groove  of  the 
DNA  is  apparent.  This  interaction  may  give  us  insight  into  the  mechanism 
by  which  TFllIA  is  able  to  bind  to  DNA.  The  molecular  details  describing  the 
interaction  between  individual  atoms  of  the  protein  and  DNA  will  be 
available  in  the  next  few  months  as  the  higher  resolution  phases  are 
determined  and  the  final  model  is  refined. 

Crystals  will  be  grown  for  an  additional  data  collection  trip  to  Brookhaven, 
which  will  occur  in  late  October  1996,  with  the  intention  of  extending  the 
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resolution  of  this  structure  so  that  the  finer  details  of  the  DNA  /  protein 
interaction  can  be  determined. 

Many  genes  with  key  roles  in  cancer  are  regulated  by  zinc-finger  containing 
proteins.  Understanding  the  principles  of  DNA  recognition  by  members  of 
this  family  will  be  important  for  establishing  a  picture  of  their  transcriptional 
control.  In  particular,  the  feasibility  of  efforts  to  find  drugs  that  target  gene- 
specific  transcriptional  regulation  will  depend  on  such  understanding.  The 
TFIHA  structure,  when  completed,  will  contain  the  longest  strand  of  DNA 
and  more  zinc  fingers  than  has  been  observed  in  any  other  structure  to  date. 
It  will  greatly  expand  the  database  of  interactions,  and  it  has  already  suggested 
a  novel  binding  interaction  to  the  minor  groove  of  DNA,  not  previously 
observed. 

Src-family  kinases. 

We  have  determined  the  structures  of  the  regulatory  domains  of  the 
lymphocyte  t)^osine  kinase,  Lck  (Eck,  et  al,  1993;Eck,  et  al,  1994),  and  members 
of  the  laboratory  are  currently  engaged  in  efforts  to  crystallize  the  auto- 
inhibited  state  of  the  full-length,  multi-domain  molecule  of  Lck  and  Src.  Src- 
like  kinases  of  this  sort  are  important  in  a  variety  of  tumors,  and  they  appear 
to  modify  proteins  that  have  a  role  in  breast  cancer  (see,  for  example,  Arnold 
et  al,  1995).  Upon  completion  of  the  TFIIIA  project.  Dr.  Nolte  will  build  upon 
the  work  on  Lck /Src  kinases  by  examining  the  structure  of  the  immediate 
downstream  target  of  Lck  -  another  tyrosine  kinase  known  as  ZAP-70,  which 
contains  two  SH2  domains  and  a  kinase  domain.  Preliminary  studies 
involving  the  expression  and  purification  of  ZAP-70  have  been  carried  out, 
and  a  small  amount  of  protein  has  been  purified  to  demonstrate  the 
feasibility  of  this  project.  ZAP-70  kinase  was  expressed  and  purified  using  an 
Sf-9/Baculovirus  expression  system  as  follows:  A  fragment  encoding 
residues  1  to  619  of  human  ZAP-70  tyrosine  kinase  with  an  N-terminal 
polyhistidine  tag  was  subcloned  into  the  pVL1392  and  pBlueBac4  (Invitrogen) 
transfer  vectors  (Pst  I  and  EcoRI  restriction  sites)  using  standard  PCR-based 
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methods.  Recombinant  Baculovirus  was  obtained  by  cotransforming  Sf-9 
cells  with  linearized  viral  DNA  (Baculogold,  Pharmingen)  and  the  transfer 
vector.  High-titer  viral  stocks  (~109)  were  prepared  from  plaque-purified 
recombinant  virus.  For  protein  production,  Sf-9  cells  were  cultured  in  2  L 
Spinner  flasks  in  TNM/FH  media  supplemented  with  10  %  serum.  Cells 
were  infected  with  25  ml/liter  viral  stocks  and  harvested  64-72  hours  post¬ 
infection  by  centrifugation.  Cells  were  lysed  by  sonication  and  the  lysates 
cleared  by  high  speed  centrifugation.  The  resulting  supernatant  was  loaded 
onto  a  Nickel-NTA  column,  washed  and  eluted  with  200mM  imidazol.  The 
ZAP-70  containing  fractions  were  further  purified  by  ion  exchange  on  SP- 
Sepharose  colxunn,  followed  by  gel  filtration  on  Superdex-200  column  to 
achieve  >  95%  pure  protein.  Fractions  containing  ZAP-70  were  identified  by 
SDS-PAGE  and  Western  Blotting  (anti-human  ZAP-70)  during  the 
purification. 

Conclusion; 

The  crystallographic  study  of  a  complex  between  (TFIIIA  fingers  1-6)  and 
cognate  DNA  will,  when  completed,  yield  significant  new  information  about 
modes  of  specific  DNA  recognition  by  this  family  of  transcription  factors. 
Other  work  in  our  laboratory  on  gene  regulation  at  complex 
promoter/ enhancers  will  then  be  poised  to  capitalize  on  the  results.  Dr. 
Nolte  will  himself  corriplete  this  period  of  training  and  prepare  to  launch 
independent  directions,  by  turning  his  attention  to  structural  analysis  of 
signal  transduction  pathways.  Parallel  efforts  in  our  laboratory  have  been 
successful  at  expression  and  crystallization  of  domains  of  the  Lck  and  Src 
kinases,  and  we  will  expand  this  effort  to  include  the  downstream  "effector" 
of  Lck,  ZAP-70. 
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A)  Sequence  of  the  two  31  base  pair  oligonucleotides  contained 
in  the  TFIIIA  crystal  structure 
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Figure  2 

Electron  Density  Map  of  TFIIIA  ;  DNA  Structure 
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Figure  4 

TFIIIA  Structure  :  DNA  /  Protein  Organization 
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Table  1 :  Summary  of  X-ray  Diffraction  Statistics  for  TFIHA  Data  Sets 


Name 

N23 

N13 

Derivative 

Native 

Native 

Wavelength  (A) 

1.283 

1.283 

Resolution 

(High) 

3.10  A 

3.16A 

R-Factor  (High) 

5.1% 

14.2% 

Resolution 

(Nominal) 

3.20A 

3.50A 

R-Factor 

(Nominal) 

4.97% 

10.9% 

I  /  Sigma 
(Average) 

15.3 

8.0 

Completeness 

92.0% 

59.5% 

Frames  Collected 

157 

140 

Frame  Width 
(deg) 

1.25 

1.50 

Dataset 

Multiplicity 

2.04 

1.69 

N1 

RB26 

RB27 

RBI3 

RB57 

Native 

BCF 

BCF 

BCE 

BCE 

1.542 

1.283 

1.283 

1.542 

1.283 

3.10A 

3.15A 

3.20A 

3.20A 

3.20A 

10.8% 

10.3% 

9.2% 

8.4% 

8.1% 

3.65A 

3.46A 

3.49A 

3.55A 

3.60A 

5.9% 

7.0% 

8.8% 

9.4% 

7.6% 

10.7 

14.4 

9.5 

12.2 

lO.l 

97.0% 

83.3% 

92.8% 

94.0% 

86.3% 

333 

210 

216 

293 

140 

1.0 

1.5 

2.5 

1.0 

1.5 

3.06 

3.01 

2.6 

2.17 

1.82 
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Table  2  :  Tb’lUA  Phasing  Statistics 


Resolution 
Figure  of  Merit 
Phasing  Power 
Cullis  R-Factor 


Initial  MIR  Phases  Current  Density  Modified  Phases 
20-4.0  A  25-3.1  A 

0.390  0.645 

1.61 
0.84 


Note  : 

Initial  phases  :  Calculated  using  the  MLPHARE  program  in  the  CCP4  package 
(Otwinowski,  1991) 

Current  Phases  :  Calculated  with  the  DM  program  (Cowtan,  1994) 
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